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An Addendum to "How Good is PSK for 
Peak-Limited Fading Channels in the Low-SNR 

Regime?" 

Wenyi Zhang, Member, IEEE * 
Abstract 

A proof is provided of the operational achievability of i?i-t by the 
recursive training scheme in [I], for general wide-sense stationary and 
ergodic Rayleigh fading processes. 



1 Introduction 

> 

00 , _ 

^T) I In [I], an issue remaining open is whether the recursive training scheme 

interpretation [1, Sec. HLB] is operational for fading processes with an in- 
finitely long memory length. As commented in [1], the main difficulty in 
, analysis pertains to the effect of error propagation, which cannot be readily 

QQ ' circumvented in order to rigorously establish a channel coding theorem. In 

O ■ this addendum, we provide a proof of the operational achievability of Rj^ by 

the recursive training scheme, for general wide-sense stationary and ergodic 
Rayleigh fading processes, without the m-dependence constraint as further 
imposed in the partial proof in [1]. The present proof exploits certain tech- 
niques in mismatched decoding, and its basic idea differs considerably from 
that in the previous proof for m-dependent fading processes. 

Before proceeding to our proof, we briefly sketch a heuristic reasoning 
which has often been invoked to argue that the recursive training scheme is 
operational for channels with memory, and point out why such a heuristic 
reasoning is insufficient to lead to a mathematically rigorous proof. Con- 
sider the recursive training scheme as illustrated in [Ij Fig. 1] . The heuristic 
reasoning is as follows. For general fading processes that possess infinite 
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memory, some residual correlation remains within each parallel sub-channel 
(PSC) among its K symbols. Due to the ergodicity of the fading process, as 
we let the interleaving depth L grow to infinity, this correlation asymptot- 
ically vanishes and thus each PSC can then be viewed as essentially mem- 
oryless. Meanwhile, as we let the coding blocklength K grow to infinity, 
the channel coding theorem {e.g., [2j) for the L asymptotically memoryless 
PSCs ensures that each PSC can achieve a rate arbitrarily close to the input- 
constrained capacity of a true memoryless fading channel, and consequently 
the arbitrarily reliably decoded symbols can be used as essentially error-free 
training pilots for the recursive channel estimation/decoding procedure. 

Unfortunately, the outlined heuristic reasoning is inherently flawed. It 
does not properly resolve a tension between channel memory decorrelation 
and error propagation. The essentially memoryless PSCs are obtained at the 
cost of indefinitely increasing the interleaving depth L. As L increases, the 
block error probability of decoding each PSC is required to decrease inversely 
proportionally with L, in order to prevent catastrophic error propagation, 
and thus the coding block length K is required to increase correspondingly. 
However, it is unclear how valid the "essentially memoryless" property of 
PSCs is as K scales with L asymptotically, because a formal mathematical 
characterization of such a property would involve a limiting argument that 
fixes a sufficiently large K and subsequently lets L grow large. At this 
point, we have observed a logic loop between the scaling of K and L. So 
even if the heuristic reasoning were indeed correct, it lacks a mathematical 
confirmation, except for fading processes with finite memory, namely, the 
so-called m-dependent fading processes. 

In our proof in this addendum, we break the logic loop by not requiring 
the PSCs be essentially memoryless, and hence relieve the tension between 
channel memory decorrelation and error propagation, since K and L do not 
need to scale with each other. However, without the essentially memoryless 
condition, we can no longer ignore the residual correlation within each PSC. 
Fortunately, the generalized mutual information (GMI) in the theory of 
mismatched decoding (see, e.g., [3] and references therein) provides us an 
alternative way of establishing the achievability of information rates (at 
least) up to the input-constrained capacity of an associated memoryless 
fading channel, as will be elaborated shortly. 



2 



2 Proof 



Consider the recursive training scheme as described in ^ Sec. III.B]. We 
seek to prove that, for an arbitrarily small decoding error rate 6 > and an 
arbitrarily small overall rate loss factor A > 0, there exists a code such that 
the recursive coding scheme achieves the rate (1 — A) • i?rt with an average 
decoding error probability no greater than 6, where R^t is defined by HI 
eqn. (23)]. 

For our purpose, we also consider a virtual system consisting of infinitely 
many PSCs. Among those PSCs, PSC / is a Rayleigh fading channel with 
perfect receive channel state information (CSI), and with independent and 
identically distributed (i.i.d.) phase-shift keying (PSK) inputs of average 
SNR p[l] as defined by [U Eqn. (19)]. Furthermore, the fading process 
within each PSC is memoryless, and the noise samples are also i.i.d. zero- 
mean circular complex Gaussian. By such a construction, the L-average 
capacity, namely the average input-constrained capacity of the first L PSCs 
in this virtual system approaches R^t, as L grows asymptotically large (cf. 
[H Sec. III.A]). In our proof, we hence fix a sufficiently large L such that 
the L-average capacity of the virtual system exceeds (1 — A/2) • R^f 

In order to complete the achievability proof, it suffices to show that PSC 
/ in the recursive training scheme achieves at least the input-constrained 
capacity of memoryless PSC I in the virtual system, for every / from to 
(L - 1). In hght of P Eqn. (18)-(19)] and the one-step MMSE prediction 
in recursive training, we can write the channel equation of PSC I as 

X[l-k] = Hd[l;k]- S[l;k] + Z[l;k], k = l,...,K, (1) 

where the index [/; k] denotes the kth channel use of PSC /. Due to the 
wide-sense stationarity and ergodicity of the original fading process, both 
{Hd[l', ■]} and {Z[l; :]} sequences are wide-sense stationary and ergodic with 
respect to index k. In this addendum, we further normalize them to be of 
unit variance and thus p[l] is the average SNR. However, due to the non- 
negligible residual correlation among channel uses within PSC /, in general 
neither {Hd[l', '■]} nor {Z[l; :]} sequence can be a circular complex Gaussian 
process. Fortunately, we have and shall utilize a "locally" Gaussian property 
of ([I]). That is, for each k, Hd[l;k] and Z[l;k] are jointly Gaussian and in 
fact independent, as shown in pTj Sec. III.A]. 

Let us evaluate the GMI of ([1]) following [3, Sec. III]. For a given code- 
book {sm[l', ■]}m=i ^^'^ ^ given realization of received channel outputs x[l;:] 
along with channel fading process hd[l;:], a mismatched channel decoder 
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chooses the message m that minimizes the metric 



D(m) 



1 ^ 



k=l 



x[l;k] - Jp[l] ■ hd[l;k] ■ Sm[l;k] 



That is, the decoder treats the PSC as if it is were memoryless Rayleigh fad- 
ing with i.i.d. circular complex Gaussian noise. Consider the ensemble of all 
codebooks i.i.d. generated by a prescribed PSK constellation, and without 
loss of generality assume the first message is selected with its corresponding 
codeword transmitted. First, due to ergodicity, the metric D(l) converges 
almost surely to unity, since 



T 



lim D{1) 

K—too 



1 



K 



lim —y^\Z[l;k]\ 



k=l 



B{\Z[l;kf} 



1, 



a.s. 



(3) 



The probability that an incorrect codeword accumulates a metric smaller 
than unity decays exponentially in K, and the GMI is just the exponent 
given by 



^GMi = sup(/i - A(//)). 

Ai<0 



(4) 



The limiting log-moment generating function A(/i) = lim/<^oo -j^^k{K p) is 
induced via 



A^(/i) = logE {e'^^(-')| :], :]} , 



(5) 



for m' > 1. Because the inputs within each codeword are i.i.d. PSK symbols, 
conditioning upon :] and Hd^] :]) we have 



K 



K 



k=l 



^l\x[l■,k]-yJp[^\■H^^,[l■,k]■s 



X[l-k],Hd[l-k]\. (6) 



where the expectation is with respect to the PSK constellation S. For con- 
creteness, let us denote the PSK constellation by 5 € {0j}^j^i with each 
constellation point selected equiprobably. Hence we can further write ([6]) as 



lA^(i^^) = -iElogE, 

k=i j=i 



^^l\x[l■,k]~J p[i]-ej-Hi[hk] 



log J. 



(7) 
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Due to ergodicity, for every /x < within a certain finite neighborhood of 
zero, A()u) converges almost surely to an expectation, as 

A{n) = lim ^Kk{Kh) 

A— >oo A 

= E|logf^^e'^l^-V^-^^-^'^r| -logJ, a.s., (8) 

where the expectation is with respect to the fading coefficient ~ CAA(0, 1) 
and the channel output corresponding to the first message, X = \/ p\l] ■ Ha ■ 
Si + Z with Z CM{0, 1) independent of H^- Note that the probability 
density function of X conditioned upon is 

fx|^,(-IM = jE^e-|-VM^-^^-^ar. (9) 

So by letting /j, = —1, we have 

/gmi > -l-A(-l) 

= -l-log7r-E{E{logp^l^^(X|^d)}|^d} 

= hiX\HA)-hiZ) 

= h{X\HA)-h{X\Si,HA)=I{Si;X\HA), (10) 

which is precisely the input-constrained capacity of memoryless PSC I in the 
virtual system. Hence as we choose a sufficiently large coding blocklength 
K such that for every / from to (L — 1), there exist a codebook and 
an associated mismatched channel decoder that achieve a fraction of (1 — 
A)/(l — A/2) of the GMI of PSC I with an average decoding error probability 
no greater than S/L, the entire recursive training scheme achieves the rate 
(1 — A) • i?rt with an average decoding error probability no greater than 6, 
following a standard union upper bounding argument. This concludes our 
proof. 
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